Design of the Comprehensive Fold Recognition Benchmark. Application to SeqFold, Training and Validation

نویسنده

  • Krzysztof A. Olszewski
چکیده

Recent exponential increase of protein sequences creates a challenge for automated annotation methods. When sequence based methods (e.g. PSIBLAST [1]) fail to identify a possible homologue (generally below 25% of protein identity i.e. within so-called twilight zone), fold recognition methods offers additional sensitivity [2,4,5,8]. However, training, validating and comparing fold recognition performance proves to be difficult. This work presents a comprehensive attempt to design a universal, two-tier, balanced fold recognition benchmark that can be used to perform all above mentioned tasks. Also, results of training and comparison of different fold recognition scoring strategies are discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Long-term Streamflow Forecasting by Adaptive Neuro-Fuzzy Inference System Using K-fold Cross-validation: (Case Study: Taleghan Basin, Iran)

Streamflow forecasting has an important role in water resource management (e.g. flood control, drought management, reservoir design, etc.). In this paper, the application of Adaptive Neuro Fuzzy Inference System (ANFIS) is used for long-term streamflow forecasting (monthly, seasonal) and moreover, cross-validation method (K-fold) is investigated to evaluate test-training data in the model.Then,...

متن کامل

Raters’ Perception and Expertise in Evaluating Second Language Compositions

The consideration of rater training is very important in construct validation of a writing test because it is through training that raters are adapted to the use of students’ writing ability instead of their own criteria for assessing compositions (Charney, 1984). However, although training has been discussed in the literature of writing assessment, there is little research regarding raters’ pe...

متن کامل

Face Recognition using Eigenfaces , PCA and Supprot Vector Machines

This paper is based on a combination of the principal component analysis (PCA), eigenface and support vector machines. Using N-fold method and with respect to the value of N, any person’s face images are divided into two sections. As a result, vectors of training features and test features are obtain ed. Classification precision and accuracy was examined with three different types of kernel and...

متن کامل

A comprehensive benchmark between two filter-based multiple-point simulation algorithms

Computer graphics offer various gadgets to enhance the reconstruction of high-order statistics that are not correctly addressed by the two-point statistics approaches. Almost all the newly developed multiple-point geostatistics (MPS) algorithms, to some extent, adapt these techniques to increase the simulation accuracy and efficiency. In this work, a scrutiny comparison between our recently dev...

متن کامل

Design and Validation of Stress Management Training Program for First-Grade Female High School Teachers in Shahriar City

Introduction: The purpose of this study was to design a stress management training program and determine its effectiveness on the quality of work life, career engagement and professional ethics for high school teachers. Methodology: The research was done in a mixed method in two stages. The first stage was qualitative that training package was created and the second stage was quantitative, impl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000